978 - 1 - 107 - 04119 - 6 - The Cambridge Handbook of Learner Corpus
نویسندگان
چکیده
Written and spoken data produced by learners has always been a key resource for the study of second language acquisition (SLA). However, for a long time the data used was rather artifi cial, i.e. resulting from highly controlled language tasks , and therefore not necessarily a refl ection of what learners do in more natural communication contexts. In addition, the data samples were usually quite small, often involving no more than a handful of learners, and therefore raised concerns in terms of representativeness. The combined wish to address these two issues and produce more learner-aware/learner-focused pedagogical tools prompted the emergence of learner corpora, which can be defi ned as electronic collections of natural or near-natural data produced by foreign or second language (L2) learners and assembled according to explicit design criteria. Learner corpora gave rise to a fl urry of studies, which have come to be grouped under the umbrella term of ‘learner corpus research’ (LCR). This new research strand emerged in the late 1980s as an offshoot of corpus linguistics, a fi eld which had shown great potential in investigating a wide range of native language varieties (diachronic, stylistic, regional) but had neglected the non-native varieties. In the case of English, by far the most widely investigated language at the time, this was hardly justifi ed in view of the fact that the number of non-native speakers far outnumbers that of native speakers. Having access to electronic collections of L2 data presents two signifi cant advantages. First, as these collections are usually quite large and are collected from a great number of learners, they are arguably more representative than smaller data samples involving a limited number of learners. Second being in electronic format, the data can be analysed with a whole battery of software tools that greatly speed up the analysis and enable a wide range of investigations that either cannot be performed manually at all or only at huge cost in terms of human resources. Part-of-speech taggers , for example, assign to each word in a learner
منابع مشابه
Connecting NLP and Language Learning
I Detmar Meurers (2012). Natural Language Processing and Language Learning. Encyclopedia of Applied Linguistics, edited by Carol A. Chapelle. Blackwell. 4193–4205. I Detmar Meurers (2015). Learner Corpora and Natural Language Processing. The Cambridge Handbook of Learner Corpus Research, edited by Sylviane Granger, Gaëtanelle Gilquin and Fanny Meunier. Cambridge University Press. I Luiz Amaral ...
متن کاملCambridge University Press 978 - 1 - 107 - 06718 - 9 - The Cambridge Handbook of Meeting Science
Acton, B., 618 Adams, L., 671 Agnew, C. R., 326 Aksoy-Burkert, F., 7 Albon, R., 526 Allen, J. A., 3–6, 7, 8, 9, 10, 12–17, 49–50, 59, 70, 158–159, 165, 307, 310–311, 373, 382, 523, 620, 738, 740 Allison, B., 9 Ambady, N., 461 Amrhein, P., 417, 424 Anderson, A. H., 133, 134, 135, 664–665 Anderson, N., 525, 589 Andriessen, J. H. E., 668 Anson, R., 51 Arafeh, J. M., 624 Argote, L., 365 Arrow, H., ...
متن کاملThe Cambridge Learner Corpus - error coding and analysis for lexicography and ELT
The Cambridge Learner Corpus is a 16 million-word corpus of Learner English collected by Cambridge University Press in collaboration with the University of Cambridge Local Examinations Syndicate (now Cambridge ESOL). It comprises English examination scripts, transcribed retaining all errors, written by learners of English with 86 different mother tongues. The scripts range across 8 EFL examinat...
متن کاملHedges in English for Academic Purposes: A Corpus-based study of Iranian EFL learners
Hedges, as tools to express tentativeness and doubt, have been studied in plenty of research papers in the Iranian EFL research setting. However, their use in a learner corpus, portraying Iranian learner English, is in need of more research attention. With this end in view, this study aimed at investigating how Iranian EFL learners who have majored in English-related fields in Iran deployed hed...
متن کاملMetadiscourse Markers in a Corpus of Learner Language: The Case of Iranian EFL Learners
Different issues have been probed in learner corpus research since the late 1980s.However, taking the im- portance of meta discourse markers (MDMs) in signposting academic discourse, their use in Iranian EFL learners‟ academic essays is an area of research in need of a more serious analysis. Contributing to this line of investigation, this paper reports a corpus-based study of the use of MDMs i...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015